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Process for detecting specific nucleotide variations and genetic polymorphisms present In nucleic acids and kits therefor. 

@ Single or multiple nucleotide variations in nucleic acid 

sequence can be detected in nucleic acids by a process 

whereby the sample suspected of containing the relevant 

nucleic acid is repeatedly treated with primers* nucleotide 

triphosphates, and an agent for polymerization of the 

triphosphates and then denatured, in a process which 

amplifies the sequence containing the nucleotide variation if 

it is present, tn one embodiment, the sample is spotted on a 
^ membrane and treated with a labeled sequence-specific 

oligonucleotide probe. Hybridization of the probe to the 
^ sample is detected by the label on the probe. 
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PROCESS FOR DETECTING SPECIFIC NUCLEOTIDE 
VARIATIONS AND GENETIC POLYMORPHISMS 
PRESENT IN NUCLEIC ACIDS 
AiND KITS THEREFOR 

This Invention relates to a process for detecting nucleotide 
. 5 variations, nutations and polymorphisms by anplifying nucleic acid 
sequences suspected of containing such mutations or polymorphisms and 
detecting them with sequence-specific oligonucleotides in a dot blot 
format. 

In recent years, the molecular basis of a number of human 

10 genetic diseases has been elucidated by the application of recombinant 
DNA technology. In particular, the detection of specific polymorphic 
restriction sites in human genomic DNA associated with genetic 
disease, such as sickle-cell anemia, has provided clinically valuable 
Information for prenatal diagnosis. In these studies, the presence or 

15 -absence of a specific site is revealed by restriction fragment length 
polymorphism (RFLP) analysis, a method in which variation in the size 
of a specific genomic restriction fragment is detected by Southern 
blotting and hybridization of the immobilized genomic DNA with a 
labeled probe. RFLP analysis has proved useful in the direct 

20 detection of polymorphic sites that contain the nutation conferring 
the disease phenotype (e.g.. Mstll and sickle-cell anemia) as well as 
In linkage studies where a particular allelic restriction site Is 
linked to a disease locus within a family but not necessarily In the 
general population. See. for example, Kan and Dozy. PNAS_ (USA). 75 

25 5631 (1978), and Rubin and Kan, Lancet. 1985-1. 75 (1985). See a^ 
Geever et al.. PNAS. (USA). 78, 5081 (1981) and Wilson et al.. PNAS 
(USA). _79. 3628 (1982). 

In a second method, called "oligomer restriction", a 
synthetic end-labeled oligonucleotide probe Is annealed in solution to 
30 the target genomic DNA sequence and a restriction enzyme Is added to 
cleave any hybrids formed.. This method, the specificity of which 
depends on the ability of a base pair mismatch within the restriction 
site to abolish or inhibit cleavage, is described more fully by Salkl 
et al., Biotechnoloqyr^, 3, 1008-1012 (1985). In addition, the 
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sensitivity of this technique may be enhanced by utilizing a 
polymerase .chain reaction procedure wherein the sain)le. Is first 
subjected to treatment with specific primers, polymerase and 
nucleotides to amplify the signal for subsequent detection. This Is 
5 described more fully by Salkl et al Science. 230, 1350-1353 (1985). 

A third method for detecting allelic variations which Is 
Independent of restriction site polymorphism utilizes sequence- 
specific synthetic oligonucleotide probes. See Conner et al., PNAS 
(USA). 80, 78 (1983). This latter technique has been applied to the 
10 prenatal diagnosis of ed -anti trypsin deficiency (Mdd et al.. Nature. 
304. 230 (1983)) and p-thalassemia (Pirastu et al., N. Engl. J. Med.'. 
309, 284 (1983)). In addition, the technique has been applied to 
study the polymorphism of HLA-DR& using Southern blotting (Angel 1n1 et 
al., PNAS (USA), 83, 4489-4493 (1986)). 

"The basis for this procedure Is that under appropriate 
hybridization conditions a short oligonucleotide probe of at least 19 
- bases (19-mer) will anneal only to those sequences to which it is 
perfectly matched, a single base pair mismatch being sufficiently 
destabilizing to prevent hybridization. The distinction between the 
20 allelic variants is based on the thermal stability of the duplex 
formed between the genomic DNA and the oligonucleotide (19-mer) probe. 

In addition, methods for detecting base pair mismatches in 
double-stranded RNA and RNA:DNA heteroduplexes have been described 
using pancreatic ribonuclease (RNase A) to cleave the 
25 heteroduplexes. Winter et al.. PNAS (USA), 82:7575-7579 (1985) and 
Myers et al.. Science. 230:1242-1246 (1985). The principal deficiency 
of this method Is its inability to recognize all types of base pair 
mismatches. 

Both the RFLP and oligonucleotide probe stability methods 
30 are relatively complex procedures, requiring restriction enzyme 
digestion, gel-f ractionatlon of the genomic DNA, denaturation of the 
DNA, immobilization of the DNA either by transfer to a filter menbrane 
or dessication of the gel itself, and hybridization of a labeled probe 
to the electrophoretically resolved array of immobilized genomic 
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restriction fragments. These steps are necessary, for the 
ollgonuclebtide probe stability method, due to the complexity of hunan 
genomic DNA. Restriction, and electrophoresis are necessary to 
separate the target sequence ("signal") from the rest of the genome 
("noise"), and hybridization in the gel (instead of filter transfer) 
is necessary to retain as much target sequence as possible. Even 
then, detection of a signal in a 10 pg sanple using a high specific 
activity kinased probe requires an autoradiographic exposure of four 
days. 
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In addition, the approach of Conner et al. requires at least 
a 19-mer probe for reasons of specificity (a shorter probe would 
hybridize to more genomic fragments), as well as possibly 
sensitivity. Shorter probes (e.g., 16^ners). however, would show more 
sequence-specific discrimination because a single mismatch would be 
15 more destabilizing. 

There is a need in the art for a simplified method with 
Improved sensitivity and specificity for directly detecting single- 
base differences in nucleic acids such as genomic DNA. 

Accordingly, the present invention provides a method for 
?o detecting single or nultiple nucleotide variations in nucleic acid 
sequence from any source, for use in detecting any type of disease or 
condition. The method herein directly detects the sequence variation, 
eliminating the need for restriction digestion, electrophoresis, and 
gel manipulations otherwise required. In addition, the method herein 
25 provides for improved specificity and sensitivity of the probe; an 
interpretable signal can be obtained with a 0.04 yg sanple in six 
hours. Thirdly, if the amount of sample spotted on a membrane is 
increased to 0.1-0.5 yg, non-isotopically labeled oligonucleotides may 
be utilized rather than the radioactive probes used in previous 
30 methods. Finally, the process herein is applicable to use of 
sequence-specific oligonucleotides less than 19-mers in size, thus 
allowing use of more discriminatory sequence-specific 
oligonucleotides. 
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Specifically, the present Invention provides for a process 
for detecting the presence or absence of at least one nucleotide 
variation In sequence In one -or more nucleic acids contained In a 
sample, which process conprlses: 

5 (a) treating the sample, together or sequentially, with 

four different nucleotide triphosphates, an agent for polymerization 
of the nucleotide triphosphates, and one oligonucleotide primer for 
each strand of each nucleic acid suspected of containing said 
variation under hybridizing conditions, such that for each nucleic 

10 acid strand containing each different variation to be detected, an 
extension product of each primer Is synthesized which Is conplementary 
to each nucleic acid strand, wherein said primer or primers are 
selected so as to be substantially conplementary to each nucleic add 
strand containing each different variation, such that the extension 

15 product synthesized from one primer, when It Is separated from Its 
complement, can serve as a template for synthesis of the extension 
product' of the other primer; 

(b) treating the sample under denaturing conditions to 
separate the primer extension products from their templates if the 

20 varlatlon(s) to be detected are present; 

(c) treating the sample, together or sequentially, with 
said four nucleotide triphosphates, an agent for polymerization of the 
nucleotide triphosphates, and oligonucleotide primers such that a 
primer extension product Is synthesized using each of the single 

25 strands produced 1n step (b) as a template, wherein steps (b) and (c) 
are repeated a sufficient number of times to result In detectable 
amplification of the nucleic acid containing the sequence 
varlatlon(s), if present; 

(d) affixing the product of step (c) to a membrane; 

30 (e) treating the membrane under hybridization conditions 

with a labeled sequence-specific oligonucleotide probe capable of 
hybridizing with the amplified nucleic acid sequence only if a 
sequence of the probe is complementary to a region of the anplified 
sequence; and 
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a..Hfi.d ''''' ^" ^^^"-dl^d to an 

amplified sequence In the nucleic acid sample. 

Steps (b) and (c) are preferably repeated at least fiv. 
times, and more preferably 15-30 times if the sample contains hi 
5 genomic DNA Tf ^ i ^«nihic contains human 

genome DNA. if the sample conprtses cells, preferably they are 
heated before step (a) to expose the nucleic acfds therein to the 

:eZr,.i»on"'' - -"p- 
- ""-o-^'^niho^rr.:;: rt:::-thrre:r!jr: 

sequence Is labeled. The labeled prlmer(s) and/or ^u^ll . 
tr«ate,s, can be present In the reaction Lturflnl 

n abe er,^'^ -ooence-speclfic olloonucleotid 

(unlabeled) .,s affixed to a membrane and treated under hybridization 
"conditions with the labeled anpllflcatlon product „ 
hybridization „1l, occur only ,f the .e.br,ne-bound sequence 
present 1n the anipllflcatlon product. 

In another embodiment, the Invention herein relates to a kit 
for detecting the presence or absence of at least one nucleotide 
20 variation „ sequence In one or more nucleic acids contained 1 
sample, which Mt comprises. In packaged form, a multlcontalner unit 

(a) one container for each oligonucleotide primer for each 

23 ^ret^d 'Z r'"' "Hatlon be n 

23 detected, which pr1mer(s) are substantially conple^nt.ry to each 

strand containing each different variation, such that an extenslo 
com lament can serve as a template for the synthesis of the extension 
30 nucleic acid sequences If the sequence var1at1on(s) are presentr 

(b) a container for an agent for polymerization; 

tH h . ' container for each of four different nucleotide 

triphosphates; 
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(d) one container for each labeled sequence-specific 
oligonucleotide capable of hybridizing with each possible sequence 
variation In the amplified nucleic acid sequence; and 

(e) contalner(s) for reagents vrtilch detect hybr'^dlzatlon of 
. 5 the probes to the amplified sequence. 

Regarding genetic diseases, while RFLP requires a 
polymorphic restriction site to be associated with the disease, 
sequence-specific oligonucleotides directly detect the genetic lesion 
and are generally more useful for the analysis of such genetic 

10 diseases as hemoglobin C disease, al -anti trypsin and B-thalassemla 
which result from single or multiple base mutations. In addition, the 
oligonucleotides can be used to distinguish between genetic variants 
which represent different alleles (e.g., HLA typing), indicating the 
feasibility of a sequence-specific ollgonucleoti de-based HLA typing 

15 kit. 

The term "nucleotide variation In sequence" refers to any 
single or multiple nucleotide substitutions, deletions or 
insertions. These nucleotide variations may be mutant or polymorphic 
allele variations. Therefore, the process herein can detect single 

20 nucleotide changes in nucleic acids such as occur In B-globin genetic 
diseases caused by single-base mutations, additions and deletions 
(some e-thalassemias, sickle cell anemia, hemoglobin C disease, etc.), 
as well as multiple-base variations such as are involved with a- 
thalassemla or some 6-thalassemias. The process can also detect 

25 polymorphisms, which are not necessarily associated with a disease, 
but are merely a condition In which two or more different nucleotide 
sequences (whether having substituted, deleted or inserted nucleotide 
base pairs) can exist at a particular site in the nucleic acid in the 
population, as with HLA regions of the human genome and random 

30 polymorphisms such as in mitochondrial DNA. The polymorphic sequence- 
specific oligonucleotide probes described in detail hereinafter may be 
used as genetic markers linked to a disease such as insulin-dependent 
diabetes or in forensic applications. If the nucleic acid is double- 
stranded, the nucleotide variation in sequence becomes a base pair 

35 variation in sequence. 
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The term "oligonucleotide" as used herein is defined as a 
^lecule co^Msed of two or .ore deoxyribonucleotldes or 
ribonucleotides, preferably more than three. Its exact size will 
depend on many factors, which in turn depend on the ultimate function 
5 or use Of the oligonucleotide. The oligonucleotide may be derived 
synthetically or by cloning. 

The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which Is capable of 

10 acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product which is 
complementary to a nucleic acid strand Is induced, i.e in the 
presence of four different nucleotide triphosphates and an Igent for 
polymerization such as DNA polymerase In an appropriate buffer 

15 ( buffer" includes pH. ionic strength, cofactors. etc.) and at a 
suitable temperature. 

The primer is preferably single stranded for maximum 
efficiency in amplification, but may alternatively be double 
branded. If double stranded, the primer is first treated to separate 
Its strands before being used to prepare extension products 
Preferably, the primer is an oligodeo^yribonucleotide. The primer 
mst be sufficiently long to prime the synthesis of extension products 
in the presence of the agent for polymerization. The exact lengths of 
the primers will depend on many factors, including teiiperature and 

25 source of primer and use of the method. For exartple. depending on the 
complexity of the target sequence, the oligonucleotide primer 
typically contains 15-25 nucleotides, although it may contain more or 
fewer nucleotides. Short primer molecules generally require lower 
temperatures to form sufficiently stable hybrid conplexes with the 

30 template. 

The primers herein are selected to be "substantially" 
complementary to the different strands of each specific sequence to be 
amplified. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore 
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the primer sequence need not reflect the exact sequence of the 
template. For example, a non-complementary nucleotide fragment may be 
attached to the 5' end of the prtmer, with the remainder of the primer 
sequence being complementary to the strand. Typically, "the primers 
5 have exact complementarity to obtain the best detection results. 

The term "sequence-specific oligonucleotides" refers to 
oligonucleotides which will hybridize to specific sequences, whether 
or not contained on alleles which sequences span the nucleotide 
variation being detected and are specific for the sequence variation 
10 being detected. Depending on the sequences being analyzed, one or 
more sequence-specific oligonucleotides may be enployed for each 
sequence, as described further hereinbelow. 

As used herein, the term "thermostable enzyme" refers to an 
enzyme which is stable to heat and is heat resistant and catalyzes 

15 (facilitates) combination of the nucleotides in the proper manner to 
form the primer extension products which are complementary to each 
nucleic acid strand. Generally, the synthesis will be initiated at 
the 3* end of each primer and will proceed in the 5' direction along 
the template strand, until synthesis terminates, producing molecules 

-iO of different lengths. There may be thermostable enzymes, however, 
which initiate synthesis at the 5' end and proceed in the other 
direction, using the same process as described above. A purified 
thermostable enzyme is described more fully in Example VIII 
hereinbelow. 

25 The present invention is directed to a process for 

amplifying any one or more specific nucleic acid sequences (as defined 
herein to contain one or more nucleotide variations) suspected of 
being in one or more nucleic acids. 

In general, the present process involves a chain reaction 
30 for producing, in exponential quantities relative to the number of 
reaction steps Involved, at least one specific nucleic acid sequence 
given (a) that the ends of the required sequence are known in 
sufficient detail that oligonucleotides can be synthesized which will 
hybridize to them, and (b) that a small amount of the sequence is 
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available to InUl.te the chain reaction, product of the chain 

reaction, win be a discrete nucleic acid duplex h ell i 
corresponding to the ends of the specific primers e^lo,ed ' 

5 ut.„ H r'"' " "onpurlfled far», can be 

5 utihzed as the starting „„c,e1c acid or acids, prov.d d It 
suspected of containing the sequence being detected. ,,us t 
process «ay employ, for example. DNA or m. Including messenger' RNA 

Which DNA or m .ay be single stranded or double stranded n" 
addit, , , ^^^^^^ ^^ed. 

uti ized. A muture of any of these nucleic acids ..y aUo be 
employed, or the nucleic acids produced fro» a previous ajllficat.o 
reac ,on herein using the sa^e or different priors ly . 3 
utilized The specific nucleic acid sequence to be anpllfied .ay b 

13 d screte ^ ecule. so that the specific sequence constitutes h 
entire nucleic acid. 

It is not necessary that the sequence to be anpllfied be 
present Initially 1„ , pure for.; U ^y be , ™l„or fraTtlon of a 

20 . " ' °' ""t>'"ed In 

20 Who e h^an DNA. or a portion of nucleic acid sequence due to 
particular microorganism which organism night constitute only , ,er» 
minor fraction of a particular biological sample. The startlna 
nucleic acid may contain more than one desired specific nucleic acid 
sequence which may be the same or different, ^erefore. the present 
25 process Is useful not only for producing large amounts of one s ec1f"c 
nucle.c acid sequence, but also for amplifying simultaneously more 
than one different specific nucleic acid sequence located on the same 
or different nucleic acid molecules If more than one of the base pair 
variations in sequence is present. 

The nucleic acid or acids nay be obtained from any source 
for example, from plasmids such as pBR322. from cloned DNA or RNA or 
from natural DNA or RNA fron, any source, including bacteria, yelst 

DNror'Rr""?' "^"^"^ '''' " ^^-^^ - -^-1-' 

DNA or RNA may be extracted from blood, tissue material such as 
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chorionic villi or amniotic cells by a variety of techniques such as 
that described by Manlatis et al., Molecular Cloning (1982). 280-281, 

The cells may be directly used without purification of the 
nucleic acid If they are suspended In hypotonic buffer and heated to 
5 about 90-100'*C, until cell lysis and dispersion of intracellular 
components occur, generally about 1 to 15 minutes. After the heating 
step the amplification reagents may be added directly to the lysed 
cells. This direct cell detection method may be used on peripheral 
blood lymphocytes and amnlocytes, 

10 Any specific nucleic acid sequence can be amplified by the 

present process. It Is only necessary that a sufficient number of 
bases at both ends of the sequence be known In sufficient detail so 
that two oligonucleotide primers can be prepared which will hybridize 
to different strands of the desired sequence and at relative positions 

I^ along the sequence such that an extension product synthesized from one 
primer, when 1t Is separated from Its template (cortplement) , can serve 
as a template for extension of the other primer Into a nucleic acid of 
defined length. The greater the knowledge about the bases at both 
ends of the sequence, the greater can be the specificity of the 

-J primers for the target nucleic acid sequence, and thus the greater the 
efficiency of the process. 

It will be understood that the word "primer" as used 
hereinafter may refer to more than one primer, particularly in the 
case where there is some ambiguity in the information regarding the 

25 terminal sequence(s) of the fragment to be amplified. For instance, 
in the case where a nucleic acid sequence Is inferred from protein 
sequence Information, a collection of primers containing sequences 
representing all possible codon variations based on degeneracy of the 
genetic code will be used for each strand. One primer from this 

30 collection will be homologous with the end of the desired sequence to 
be amplified. 

The oligonucleotide primers may be prepared using any 
suitable method, such as, for example, the organic synthesis of a 
nucleic acid from nucleoside derivatives. This synthesis may be 
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performed in solution or on a solid support. One tvo^ nf 

prepare gene fragments or short genes In thp nh.c ^ . , "^'^^"^ to 

. 5 „„g.. „„„e,c adds. For a description of this method el ,r 
S- et al.. Heth. Enzw».l.. 68, 90 (1979) and us 

4 356 57n Tk. I TT^ — ' Patent No. 

4.356.270. The patent describes the synthesis and cloning of the 
somatostatin gene. ^ 

A second type of organic svnthesi«: ic u 
.0 pod. WMCK Has .een „tnUed\o preple"; t TZT" 
et al Heth. Enzymol.. 68, 109 (1979) for a description of thli 
method. AS in the phosphotrlester method, this phosphod ester ItH 

nvo ,es synthesis of ollgonucUotldes that are subsequently Jo 
together to form the desired nucleic acid. 

15 Automated embodiments of these methods may also be 

employed I„ one such automated embodiment, dlethylphosphor^ldltes 
are used as starting materials and may be synthesized as described bv 
Beaucage et al.. Tetrahedron Letters. (1981), 12:1859-1862. One method 
for synthesizing oligonucleotides on a modified solid support 1s 

^0 described In U.S. Patent No. 4.458.066. ,t Is „so possible to use 
primer which has been Isolated from a biological source (such as a 
restriction endonuclease digest). 

The specific nucleic acid sequence Is produced by using the 

.3 cL::; t"""" " • " - - 

acid cont Ins two strands. It Is necessa^ to separate the strands of 
he nucleic acid before It can be used as the te„,late. either as a 
separate step or s1«ltaneously with the synthesis of the primer 
ex ens on products. This strand separation can be acco^llshed by any 
sunable denaturing method Including physical, chemical or enz^atlc 
30 means One physical method of separating the strands of the nuT I c 
acd involves heating the nucleic acid until It Is completely (>99») 
denatured. Typical heat denaturatlon may Involve tenperatures rangino 
rom about 80 to 105-0 for times ranging from aboot 1 to 10 mlnu e ' 
Strand separation may also be Induced by an enzyme from the class o^ 



